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.SPECIFICATION e., the word that the author has mistakenly substituted 
for the intended word. 

When a sentence contains a confused word, natural language parsers 
have difficulty parsing the sentence . A natural language parser 
analyzes sentences of a natural language to discern the lexical 
and syntactic content of the sentences . For example, a chart-based 
natural language parser retrieves a dictionary entry from a 
dictionary for each word in the input sentence . The dictionary entry 
contains a lexical record containing general information about the word 
and referencing. . . 
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.SPECIFICATION in several respects. These distinctions are highlighted 
below. 

The first preferred embodiment also includes a parser 22 for parsing 
an input natural language query into its constituent parts. The 

parser 22 uses a grammar table 24 and a terminology dictionary 26. The 
grammar table 24 holds information for regulating the relation in a 

Japanese sentence, and the terminology. . . 
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Detailed Description 

705, and that a user has entered the query. 

j@ Who produces AcmeNet ' . 

either in natural language or as a conceptual graph . 

This query will be processed to produce the following sernterms . 



Request Sernterms 

Who produces AcmeNet ? evt(E. 
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Detailed Description 

Claims 

English Abstract 

A natural language information querying system includes an indexing 
facility configured to automatically generate indices of updated textual 
sources based on one or more predefined grammars and a database coupled 
to the indexing facility to store the indices for subsequent searching 



Detailed Description 

information include television feeds, cable feeds, radio feeds, and 
computer generated multimedia feeds. Text-based search engines 
typically cannot search these multimedia sources of information. 

2 

SUMMARY 

A natural language information querying system includes an indexing 
facility configured to automatically generate indices of dynamically 
updated text sources based on a predefined grammar and a database 
coupled to the indexing facility to store the indices. 

Implementations of the invention include a query engine coupled to the 
database to respond to a natural language query. 



In another aspect, a method for providing information in response to a 
natural language query , includes extracting information from an 
updated text corpus based on a predefined I 0 grammar ; and creating a 
stored indexed text corpus adapted to permit natural language 
querying . 

Implementations of the above aspect include one or more of the following. 
The method. . . 

...application, among others. 

In another aspect, a system for providing information in response to a 
natural language query includes an information extraction engine 
adapted to index an automatically updated text corpus based on a 
predefined grammar ; a database coupled to the information extraction 
engine to store the index output; and a natural language query 
engine coupled to the database to search the index in response to the 
natural language ... SBC in the Communications Services Sector" or "Did 
Barnes & Noble buy anyone this year?" The natural language query is 
parsed by the natural language user interface I 10. A variety of 
natural language parsers can be used. In one embodiment, the 
natural language interface I 10 executes one or more query grammar 
files which are analogous to the grammar files 3 10 associated with the 
search... the language of the natural language reply. For instance, a 
German user can enter a natural language query in Gennan. The 
German query can be parsed by a grammar set up to parse German 
queries , and the resulting query can be applied to documents that can 
be in Japanese, English, and German, or any. . . 

Claim 

1 A system for providing information in response to a natural language 
query , Comprising: an infonnation extraction engine adapted to index 

an updated text source based on a 
predefined grammar ; 

a database coupled to the information extraction engine to store the 
index output; and a natural language query engine coupled to the 
database to search the index for the text corpus in response to the 
natural language query. 

2 The system of claim 1, further comprising a data acquisition unit 
coupled to... wherein the client has a pen input engine adapted to convert 
user handwritings to the natural language query . 
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Detailed Description 

... such as computer languages, many effective solutions are available. 
However, the complexity and ambiguity of natural languages have made 
them resistant to efficient parsing . 

Automated sentence parsing devices usually general purpose digital 
computers -- have operated by matching parts of the input sentence 
with a very large number of stored rules. If the first rule does not 
apply. . . 
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Detailed Description 
. . . major 

components (application programs) that will be described in 
detail below: a user interface, a parser , and a sentence 
evaluator that determines the extent to which .a given sentence 
answers a submitted question. 

The user begins the process of retrieving 

information by submitting a natural language questions (e.g., 
English) to" the user interface. For example, the user might 
ask, "When ... 
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language ambiguity. 
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Internet-based information search support system stores conceptual 
structure obtained by analyzing ontology that is extracted corresponding 
to selected service of browser, with details of properties associated 
with concepts 
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NOVELTY - An extraction unit extracts an ontology corresponding to 
selected service of browser. A conceptual structure obtained by 
analyzing ontology, is stored in a memory, with details of properties 
associated with concepts . One display unit displays the conceptual 
structure for browser, and another display unit displays details of 
property associated with concept selected from displayed structure, for 
browser . 

DETAILED DESCRIPTION - INDEPENDENT CLAIMS are also included for the 
following : 

(1) application server; 

(2) information search method; and 

(3) program product for causing computer to perform information 
search process. 

USE - For providing user interface to define relationship between 
concepts extracted by analyzing ontology such as entertainment 
ontology, delivery service ontology and event ontology, and properties, 
through internet . 

ADVANTAGE - Provide user interface to enable inputting user's high 
level request and to dynamically update ontology depending on service. 

DESCRIPTION OF DRAWING (S) - The figure shows the flowchart 
illustrating the information search process. 
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Abstract (Basic): WO 200263493 Al 

NOVELTY - System uses logical and semantic structure inherent in 
the document for concept recognition to generate conceptual graphs . 
This in turn is used for iterative testing of concept groups as 
possible conceptual centers of semantic relevance, and those concepts 
that are more central or peripheral within a document are determined to 
create conceptual relations . 

DETAILED DESCRIPTION - There are INDEPENDENT CLAIMS for: 

(1) A system for generating a knowledge base 

(2) A system for automatically generating document summaries 

(3) A system for generating context-dependent document directories 

(4) A system for categorizing documents 

(5) A search engine 

USE - System is for search , retrieval , categorization, summary 
and semantic processing of documents. 

ADVANTAGE - System is simple and transparent for users to operate. 
DESCRIPTION OF DRAWING (S) - The figure shows a flowchart of the 
process flow for summary generation, 
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The information processing and location procedure draws up a 
location request and applies the request to the set of documents using 
pre-established rules to obtain the required information. 

Each document is processed by an automatic indexing system 
organising the terms in the document in synonym classes associated with 
the indexing classes and establishing lexical and semantic links 
between the classes. A set of rules are created to filter the 
information, comprising simple semantic rules and conceptual 
association rules. A similarity function is created between the 
request and the document and used to decide whether document 
corresponds to the request. 

ADVANTAGE - Faster and cheaper location of information with 
improved thoroughness and precision. 
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Abstract (Basic) : WO 9216906 A 

The retrieval system (10) comprises a data base (13), an input 
device (11a) and an intelligent interface (12). The data base contains 
information which is to be retrieved . The input device enables a user 
to use his own natural language to input a query . 

The intelligent interface transforms the natural language 
query to the formal computer language of the system. The interface 
contains processing to solve the Stainer Problem where the conceptual 
graph representing the data base is unrestricted. 

ADVANTAGE - Provides new and improved way of solving Stainer 
Problem 
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The method and appts. for generating sentences generates more 
natural and easily comprehensible sentences by incorporating the 
selection of an appropriate choice for the order of words from a number 
of possible candidates in a process of sentence construction. 

Conceptual structures to be used as elements of sentences to be 
generated are stored (1). Rules to be followed in generating sentences 
are stored (2) . Sentences are generated from the stored conceptual 
structures in accordance with the stored rules by utilising sizes of 
partial structures of the conceptual structures specified by the stored 
rules to determine the structure of the sentence to be generated. 

USE/ADVANTAGE - Machine translation system or language processing 
system e.g. summary generation system, translation telephone system or 
question answering system. Generates more natural and easily 
comprehensible sentences. (13pp Dwg.No.1/8) 
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The hybrid natural language understanding (NLU) system is 
partic. designed for processing natural language text. Primary 
functional components of the NLU system include a preprocessor, a word 
look-up and morphology module which communicates with a lexicon and a 
learning module. A syntactic parser interfaces with an augmented 
transition network (ATN) grammar and a case frame applier, converts the 
syntactic structure into canonical, semantic case frames.' A discourse 
analysis component integrates explicit and implied information in the 
text into a conceptual structure which represents its meaning. 

This structure may be passed onto a knowledge based system, data 
base, to interested analysts or decision makers. Significant feedback 
points are provided such as the case frame applier may notify the 
syntactic parser of a semantically incorrect parse. 

USE/ADVANTAGE - Makes data-base more - 'user friendly 1 . 
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ABSTRACT 



PROBLEM TO BE SOLVED: To enable a user to specify a document in accordance 
with classification by generating the classification following a viewpoint. 

SOLUTION: A word is designated, retrieval is performed and a document 
group to be an object is selected. The document is subjected to a 
morphological analysis, words are extracted and a word under consideration 
is selected on the basis of the number of frequencies. The cooccurrence 
relations (cooccurrence graph) of the word under consideration are detected 
in the sentence unit of each document. A cooccurrence graph included in 
many sentences is selected as a conceptual graph , and the conceptual 
vector of a document to be written is decided as to whether to include the 
conceptual graph . A document is classified and displayed according to 

the conceptual vector so that the document can be selected. 
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Abstract: Generating tool specific schemata and configuration information 
for OLAP database tools from conceptual graphical models is an 
important prerequisite for a comprehensive tool support for computer aided 
data warehouse engineering (CAWE) . This paper describes the design and 
implementation of such a generation component in the context of our 
BabelFish data warehouse design tool environment. It identifies the 
principal issues that are involved in the design and implementation of 
such a component and discusses possible solutions. The paper lists typical 
mismatches between the data model of commercial OLAP tools and conceptual 

graphical modeling notations, and proposes methods to overcome these 
expressive differences during the generation process. Further topics are 
the use of graph grammars for specifying and parsing graphical MD 
schema descriptions and the integration of the generation process into a 
metadata centered modeling tool environment. 13 Refs. 
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Abstract: As information databases we consider all the kinds of 
information repositories that are handled by computer systems. When 
querying very large information databases, the end-users are often faced 
with the problem to parse their questions efficiently into the query 
languages of the computer systems. Conceptual graphs were initially 
designed for natural language analysis and understanding. Due to their 
closeness to semantic networks, their expressiveness is powerful enough to 
be applied to knowledge representation and use by computer systems. This 
work demonstrates that conceptual graphs are a suitable means to model both 
the information in patient databases and the queries to these databases, 
and that operations on graphs can compute the pattern matching process 
needed to provide the answers. A prototype that exploits this model is 
presented. Experiments have been made with the material furnished by the 
Unified Medical Language System project (version 2, 1992) of the National 
Library of Medicine, USA. (Author abstract) 23 Refs. 
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Abstract: Current hypertext systems have no intelligent means for 
finding specific information. When searching for specific information 
(as opposed to browsing), users can get disoriented in large hypertext 
documents and may end up following a path that takes them further away from 
the information they seek . This paper describes an information retrieval 

system called HRS (Hyper-text Retrieval System) that allows users to 
retrieve information in hypertext documents based on its semantic content. 
HRS is comprised of an authoring system, a browser, and a graph-based 
information retrieval facility. The graph-based retrieval facility 
allows users to retrieve specific information in hypertext documents by 
posing English language queries . The retrieval facility is based on the 
use of Conceptual Graphs, a knowledge representation scheme. The English 
language queries posed by users are automatically converted to Conceptual 
Graphs by a parser . The information in hypertext documents is also 
represented using Conceptual Graphs. Query processing is treated as a 
graph matching process, and retrieval is performed by a semantic based 
search . The technology is useful for retrieval of information in large 
knowledge domains where a user needs to find specific information and 
does not know the organisation of the hypertext document or the words used 
in the document . The paper concludes that natural language retrieval of 
information in hypertext documents can provide users with both the browsing 
capabilities of hypertext and the semantic search capabilities of natural 
language query processing. (Author abstract) 38 Refs. 
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This thesis aims at building a Lexical Knowledge Base (LKB) that will 
be useful to a Natural Language Processing (NLP) system by extracting 
information from a Machine Readable Dictionary (MRD) . Our source of 
knowledge is the American Heritage First Dictionary$\spl$ (AHFD) which 
contains 1800 entries and is designed for children of age six to eight 
learning the structure and the basic vocabulary of their language. Using a 
children's dictionary allows us to restrict our vocabulary, but still work 
on general knowledge about day to day concepts and actions. 

Our Lexical Knowledge Base contains information extracted from the 
AHFD and represented using the Conceptual Graph (CG) formalism. The 
graph definitions explicitly give the information contained in all the noun 
and verb definitions from the AHFD. Each sentence of each definition is 
tagged, parsed and automatically transformed into a conceptual graph 
. The type hierarchy, extracted automatically from the definitions, groups 
all the nouns and verbs in the dictionary into a taxonomy. Covert 
categories will be discovered among the definitions and will complement the 
type hierarchy in its role for establishing concept similarity. Covert 
categories can be thought of as concepts not associated to a dictionary 
entry, such as "writing instrument" or "device giving time". They allow 
grouping of words based on different criteria than a common hypernym, and 
therefore augment the space to explore for finding similarity among 
concepts. The relation hierarchy is built manually which groups into 
subclasses/superclasses the relations used in our CG representation of 
definitions. The relations can be prepositions such as in, on or with or 
deeper semantic relations such as part-of, material or instrument. Concept 
clusters are constructed automatically around a trigger word to put it into 
a larger context. Its graph representation is joined to the graph 
representations of other words in the dictionary that are related to it. 
The set of related words forms a concept cluster and their graph 
representation, showing all the relations between them and other related 
words, is a Concept Clustering Knowledge Graph. 

One important aspect of the thesis is the underlying thread of 
finding similarity through concept and graph comparison as a general way 
of processing information. 

The ideas presented in this thesis are implemented in a system 
ARC-Concept. We present and discuss the results obtained, 
f tn$\spl$Copyright$\sp\circler$ 1994 by Houghton Mifflin Company. 
Reproduced by permission from THE AMERICAN HERITAGE FIRST DICTIONARY. 
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BEELINE is an experimental computer program that aims to understand 
raw, paragraph-length natural language instructions and use them to 
navigate in a simulation of the physical world. The work describes the 
design, construction and testing of four components which together comprise 
BEELINE: A conceptual graph reasoning engine (CG Processor), an 
actor-based conceptual parser (SAVVY) , a goal- seeking rational agent 
heuristic (NAVI), and a physical world simulation (UNSWorld) . 

In developing BEELINE, some difficult problems of artificial 
intelligence are directly confronted. To overcome the system's potential 
disengagement from the world, the thesis offers theoretical and practical 
solutions to the problems of building, situating and grounding conceptual 

graph knowledge representations of the world and the language used to 
describe it. It is argued that language cannot be properly understood as an 
abstraction, but that it must be considered as only one source of knowledge 
informing a more broadly-defined cognitive system immersed in an 
environment. To be robust enough to deal with unprocessed text, the parser 

was designed to accept the potential contribution of, but not depend on, a 
syntactical chart parser using a constituent grammar . Semantic parsing 

processes are to be kept manageable by arbitrary bounding of text handling 
and inference. A new "four-pole" theory of semantics combines these 
elements into a model of knowledge interaction. The theory is then extended 
to allow behavioural control of the integrated rational agent, which 
interprets and executes the conceptual output of the parser . 

The active components are evaluated on engineering criteria, and their 
strengths and weaknesses are discussed. They are found to function 
adequately for the purpose at hand but improvements are suggested. By 
virtue of the agent's placement in a simulated environment, the machine's 
language capability can also be operationally evaluated using behavioural 
methods. Statistical measurements in a number of trials show that with real 
natural language directions, the agent can find a destination 
significantly more efficiently than it could in identical tasks without 
them. The evaluation experiments are examined critically with a view to 
improved future language-using agents. 
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The theory, design, and implementation of a shell for domain 
independent intelligent help systems, along with a survey of some 
intelligent help systems, is presented in this document. Typical 
intelligent help systems are dependent on the application domains since 
they have concrete low-level knowledge representation language for their 
knowledge base. This concrete language works in a limited area only, 
confining the flexibility of the knowledge base and causing a problem when 
the application domain is changed. In order to solve this problem, a system 
must have a flexible knowledge base. The proposed knowledge base 
incorporates two levels: a logical level which enables reasoning, and a 
heuristic level which preserves efficiency and flexibility. 

The proposed system is called UNIversal HELP (UNIHELP) . UNIHELP 
comprises the two levels and solves the knowledge acquisition bottleneck 
problem by using Direct Knowledge Acquisition Mechanism (DKAM) which works 
on both the logical and heuristic level. Consequently, it maintains a 
flexible knowledge base called UNIversal Knowledge Base (UNIK) . UNIK can be 
accessed from both levels of UNIHELP to achieve the maximum efficiency in 
terms of speed and quality of the answer. Unlike the knowledge acquisition 
mechanism which uses a fixed script, DKAM works for learning completely new 
concepts and allows us to put natural intelligence on the artificial 
intelligence systems without separation. 

Natural language query is translated into conceptual graphs 
and it is used as a key for retrieving the corresponding plan from the 
knowledge base. The retrieving is done by matching an input conceptual 

graph to the conceptual graphs in the knowledge base. An efficient 
conceptual graph matching algorithm is presented in this document. The 
request understander which uses a deterministic parsing technique, has 
advantages of both a knowledge-based system and a grammar -based system. 
An inferencing scheme which can be used in a distributed computing 
environment is presented. This way the system can be' extended into a 
distributed help system. 

A prototype shell is presented to demonstrate some salient features of 
the type of proposed intelligent help system in this thesis. 
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Abstract: During multidatabase (MBD) conceptual modelling, besides the 
main difficulties that usually appear in DB modelling, the designer has to 
solve new specific problems. These new aspects entail the necessity of 
overloading an existing conceptual model with a great amount of new 
information and capabilities dedicated to data distribution. Such a model 
becomes unattractive to the MDB designers and difficult to be used even by 
the DB specialists. To overcome a part of these problems that we have to 
solve during the development of a MDB CASE environment, we realized that 
the success of our project depends on: 1) the creation of a conceptual 
model that naturally unifies the underlying capabilities of more data 
models as well as the needed distribution capabilities; 2) a formalism to 
facilitate the MDB modelling knowledge acquisition and representation, as 
much as possible close to the human reasoning and understanding; 3) a tool 
whose capabilities allow us to implement this formalism in a very 
attractive way; 4) a tool to store, retrieve and process a great volume 
of information. We present our conclusions on 2), specifically on the 
metamodel we are going to implement for representing a multidatabase 
object-oriented model, previously created as 1) . We perform a parallel 
analysis between the main lexical structures (and their semantic features) 
met in the natural language (NL) representation and the semantics of the 
classical concepts used in the conceptual graph (CG) theory as well as 
the semantics of the new concepts for conceptual modelling. (10 Refs) 
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Abstract: The use of conceptual graphs in information retrieval and 

natural language applications has grown rapidly in popularity and 

sophistication over the past few years. Question answering/ retrieval 
systems and the linguistic domains are making considerable headway. 
However, the problem of acquiring, and subsequently reasoning with, 
conceptual information directly from normative natural language texts is 
still a considerable problem, for much necessary control information is 
found to be absent from the written text. The paper outlines one particular 
system, (NOMOS) {Giannetti et al.) designed for the retrieval of 
conceptual data directly from natural language texts, using the 

conceptual graph formalism for its representation. It also describes 

some of the difficulties encountered when the knowledge content of the 

retrieved data was validated by mapping it directly into an expert 
system shell (Advisor) , although most of the difficulties described would 
generally apply to any attempt to use purely textual data as the source of 
a knowledge base. It is our contention that, although the expressive power 
of the conceptual graph formalism is formidable for knowledge 
representation, simply representing the textual information within 
normative texts is insufficient to enable inferencing to be performed. (11 

Refs) 
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Abstract: While much research in natural language processing (NLP) has 
been devoted to microlevel analyses of constrained text, many applications, 
such as machine translation , message understanding and information 
retrieval , call for capabilities in the understanding of unconstrained 
text. The paper discusses one step toward this type of NLP system: the 
integration of Chomsky's government-binding (GB) theory of syntax (N. 
Chomsky, 1981) with Sowa ' s conceptual - graph (CG) theory of knowledge 
representation (J. Sowa, 1984) GB theory provides a parsing technology 
that surpasses that of phrase-structure grammars , and the CG theory 
offers a formalism that is suitable for handling natural-language semantics 
and pragmatics. Their marriage is most natural and synergistic. Not only 
can their respective strengths be enjoyed, but also most intermediate steps 
required to build GCs from parse trees can be eliminated, because of the 
fact that, when it is done independently, a great deal of common knowledge 
is required both for generating parse trees with a GB-based parser and 
for translating a parse tree into a CG representation. (28 Refs) 
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Abstract: For medical records, the challenge is natural language 
processing (NLP) of texts, and the construction of an adequate knowledge 
representation. The article describes the components of an NLP system, 
which is currently being. developed in the Geneva Hospital, and within the 
European Community's AIM programme. They are: a natural language 

analyser, a conceptual graphs builder, a data base storage component, 
a query processor, a natural language generator and, in addition, a 

translator , a diagnosis encoding system and a literature indexing 
system. Taking advantage of a closed domain of knowledge, defined around a 
medical specialty, a method called proximity processing has been developed. 
In this situation no parser of, the initial text is needed, and the system 
is based on semantical information of near words in sentences . The 
benefits are: easy implementation, portability between languages, 
robustness towards badly-formed sentences , and a sound representation 
using conceptual graphs . (17 Refs) 
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ABSTRACT: 

This thesis presents the approach of Kaleidoscope, a cooperative query- 
inter face for relieving the user's burden of learning and recalling. The 
objective of this grammar -driven menu guidance is to enable users to 
construct a meaningful query by recognizing choices that match their 
mental query . The central thesis of this work is that a data model plays 
a crucial role in the Kaleidoscope's style of interfaces, as a query 
language conveys the underlying conceptualization of data to the user. The 
design of grammar , lexicon, and query translator follows a formally 
defined data model. The major technical contribution of this thesis is a 
data model formalizing the conceptual structure of restricted English 
queries . 
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Set Items Description 

51 360 NATURAL {) (TEXT? OR LANGUAGE?) 

52 4 CONCEPT? () (STRUCTUR? OR NOTATION? OR GRAPH?) OR KIF OR KIFS 

OR CGIF? 

53 16281 QUERY OR QUERIES OR RETRIEV? OR INQUIR? OR SEARCH OR SEARC- 

HES OR SEARCHING OR SEARCHENGINE? OR SEEK? OR LOCAT? OR MATCH? 
OR FIND OR FINDS OR FINDING 

54 221 SENTENC? OR GRAMMAR? OR GRAMMATIC? OR STANDARD () STRUCTUR? 

55 19914 TRANSLAT? OR MAP OR MAPPING OR MAPPED OR INDEX? OR TRANSFO- 

RM? OR CREAT? OR PARS? 

56 4 9 S3 AND S4 AND S5 

57 10 S6 AND (SI OR S2) 

58 0 S6 AND S2 

59 10 S6 AND SI 

510 0 SI AND S2 

511 2 S2 AND S3 

512 1 Sll AND (S4 OR S5) 

513 12 S7 OR S9 OR Sll 

514 12 S13 NOT PY>2000 

515 5 S14 NOT PD>20001218 
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TITLE: Web search relief ahead; here's how results become more relevant 

KCSL's X-Portal, a Windows application, creates a virtual reference desk 
that answers users 1 questions from the Web. No longer need users suffer 
with sluggish, inaccurate, complicated software sold as ' search 
facilitators. 1 X-Portal developed from Ilia Kaufman's work on development 
of spell- and grammar -checking components of Microsoft Word, WordPerfect 
and other products. Kaufman, president of KCSL, applied natural language 

processing abilities to query interpretation and results ranking. 
X-Portal includes 22 reference works, including a dictionary, a thesaurus 



...These references are stored on the user's hard disk in compressed files. 
When a query is entered, X-Portal begins by searching through these 
resources and shows results immediately. If the user wants a definition or 
less than a page of information, it is returned instantly. The program 
continues searching relevant Web pages located by search engines, 
including HotBot, AltaVista, and Yahoo!. X-Portal analyzes relevance and 
the editorial quality of pages, using internal reference works as a 
yardstick. Search results are updated, with the most relevant Web pages 
inserted as discovered at the top of this list. Surfers can depend on 
receiving a highly filtered list of search results, and will find 
X-Portal' s research abilities equally effective for both general knowledge 
and technical information. 

DESCRIPTORS: Content Providers; Encyclopedias & Almanacs; IBM PC & 
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51 523 AU=(CHA K? OR CHA, K?) 

52 664 AU= (CHUNG E? OR CHUNG, E?) 

53 6208 AU=(LIM S? OR LIM, S?) 

54 6470 AU= ( KANG H? OR KANG, H?) 

55 0 SI AND S2 AND S3 AND S4 

56 410 (SI OR S2 OR S3 OR S4) AND (NATURAL () LANGUAGE? OR LEXICOG? 

OR LINGUIST? OR MORPHOLOG? OR SEMANTIC? OR GRAMMAR? ? OR GRAM- 
MATIC?) 

57 62 S6 AND (SEARCH? OR SEEK? OR RETRIEV? OR ENGINE? OR FIND? OR 

LOCAT? OR QUERY OR QUERIES) 
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Abstract: Among the HTML elements, HTML tables left bracket RHJ98 right 
bracket encapsulate hierarchically structured data (hierarchical data in 
short) in a tabular structure. HTML tables do not come with a rigid schema 
and almost any forms of two-dimensional tables are acceptable according to 
the HTML grammar . This relaxation complicates the process of retrieving 

hierarchical data from HTML tables. In this paper, we propose an 
automated approach for retrieving hierarchical data from HTML tables. 
The proposed approach constructs the content tree of an HTML table, which 
captures the intended hierarchy of the data content of the table, without 
requiring the internal structure of the table to be known beforehand. 
Also, the user of the content tree does not deal with HTML tags while 
retrieving the desired data from the content tree. Our approach can be 
employed by (i) a query language written for retrieving hierarchically 
structured data, extracted from either the contents of HTML tables or 
other sources, (ii) a processor for converting HTML tables to XML 
documents, and (iii) a data warehousing repository for collecting 
hierarchical data from HTML tables and storing materialized views of the 
tables. The time complexity of the proposed retrieval approach is 
proportional to the number of HTML elements in an HTML table. (Author 
abstract) 10 Refs. 
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Journal Announcement: 9709W3 

Abstract: Information retrieval is to retrieve relevant information 
that satisfies user's information needs. There arises a problem of how to 
select only information that is relevant to the user. Ranking techniques 
are used to find the documents in a collection of documents that are most 
likely to be relevant to the user's query . However, we find out that 
there could be retrieved documents whose contexts may not be consistent 
to the query . Mutual information is a measure which represents the 
relation between a word and another word. So, we will re-evaluate the 
relation between the terms in the retrieved document and the terms in the 

query . In this paper, we discuss a model of natural language 
information retrieval system that is based on a two-level document 
ranking method using mutual information. At the first-level, we retrieve 
documents based on automatically constructed index terms. At the 
second-level, we reorder the retrieved documents using mutual 
information. We will show that our method achieves considerable retrieval 
effectiveness improvement over a traditional linear searching method. 
Also, we will analyze seven newly developed formulas that reorder the 
retrieved documents. Among the seven formulas, we will recommend one 
formula that dominates the others in terms of the retrieval 
effectiveness. (Author abstract) 34 Refs. 
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ABSTRACT: The new Knowledge Interchange Format ( KIF ) and Knowledge 
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ABSTRACT: This paper discusses the representational and algorithmic power 
of the conceptual graph model for natural language semantics and 
knowledge processing. Also described is a Prolog-like resolution method for 
conceptual graphs. . . 
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...ABSTRACT: existing queries. It is aimed at assisting the user by 
supporting the logical and nonlinear conceptual structure of the 
search session. A comparison of the same INSPEC search is executed by 
means of the Islands... 
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... s choices 

The three technologies chosen by Darpa as the most promising in 
detecting and retrieving documents are context vectors, conceptual 
graphs , and inference networks . HNC is developing the context-vector 
approach, which was invented by its... 

...a verbal reaction, a history, previous related events and so forth," 
Liddy explained. 

Then the conceptual graph of a query is compared with the 
conceptual graphs of the documents, resulting in a 
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Abstract (Basic) : KR 2003068856 A 

NOVELTY - A device for extracting information requested by a user 
in an unstructured document and a method thereof are provided to 
extract only specific information using an event template including 
information of who, when, where, what, and how on condition that a user 
collects an unstructured document for extracting information. 

DETAILED DESCRIPTION - An extraction information specification 
unit (10) designates and inputs information to be extracted by a user. 
If a document for an information extraction ( 21 ) is inputted, an event 
template extraction unit (20) checks whether designated information is 
included in the extraction information specification unit (10) and 
extracts an event template of a sentence unit of a specific semantic 
argument structure. An event template integrating unit (30) integrates 
event templates created in the event template extraction unit (20) 
interactively in accordance with an agreement of a semantic argument 
and contents thereof. A template extracting unit (40) extracts a 
template including only information to be extracted by a user out of 
event templates integrated in the event template integrating unit (30) 
and stores the template in a database ( 4 1 ) . 
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Abstract (Basic) : US 20020107844 Al 

NOVELTY - An analyzer (13) analyzes sentence structure of natural 
language query from user with reference to stored language knowledge 
data, to generate semantic structure. The semantic structure is 
partitioned, indexed, stored to search and supply information which is 
most semantically relevant to user's query. A processor (14) has 
sentence form at rule based on which the failure data from the analyzer 
is corrected and is transferred to the user. 

DETAILED DESCRIPTION - INDEPENDENT CLAIMS are included, for the 
following: 

(1) Information generation and retrieval method; and 

(2) Computer-readable medium storing information generation and 
retrieval program. 

USE - Internet based information generation and retrieval 
apparatus. 

ADVANTAGE - Efficient information retrieval is performed relevant 
to the standardized formation and thereby improving information 
transaction and transport, thereby satisfying demand for transaction of 
information request from user. 

DESCRIPTION OF DRAWING (S) - The figure shows the block diagram of 
the information generation and retrieval apparatus. 

Analyzer (13) 

Processor (14) 
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